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Diabetes mellitus represents a group of complex metabolic diseases that result in impaired 
glucose homeostasis, which includes destruction of p-cells or the failure of these insulin- 
secreting cells to compensate for increased metabolic demand. Despite a strong interest 
in characterizing the transcriptome of the different human islet cell types to understand the 
molecular basis of diabetes, very little attention has been paid to the role of long non-coding 
RNAs (IncRNAs) and their contribution to this disease. Here we summarize the growing 
evidence for the potential role of these IncRNAs in p-cell function and dysregulation in 
diabetes, with a focus on imprinted genomic loci. 
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INTRODUCTION 

Recent technological advances in the field of genome sequencing 
have paved the way for a new appreciation of non-coding RNAs 
in gene regulation. Ultra high-throughput transcriptome analyses 
have revealed that the vast majority of the genome is transcribed, 
with two-thirds of the human genome covered by processed tran- 
scripts, of which only a small fraction (<2%) is translated into 
proteins (Djebali etal., 2012). The identification of several com- 
mon genomic and functional features of these untranslated RNAs 
has led to their categorization into various classes of non-coding 
RNAs. One such class that has been the focus of extensive research 
is that of long non-coding RNAs (IncRNAs). 

LncRNAs are defined as transcripts longer than 200 bp that 
lack protein-coding potential (Guttman et al., 2009; Derrien et al., 
2012; Batista and Chang, 2013; Fatica and Bozzoni, 2014). Like 
messenger RNAs, IncRNAs typically have multiple exons, are 
processed using canonical splice sites, and may exist as sev- 
eral isoforms (Ponjavic etal., 2007; Cabili etal., 2011; Derrien 
etal., 2012). In contrast to mRNAs, IncRNAs preferentially dis- 
play nuclear localization, consistent with their proposed function 
in chromatin organization and regulation of gene expression 
(Khalil et al, 2009; Zhao et al, 2010; Derrien et al, 2012; Guttman 
and Rinn, 2012; Rinn and Chang, 2012; Fatica and Bozzoni, 
2014). 

Similar to protein-coding genes, IncRNA-encoding genes are 
marked by chromatin signatures typical of active transcription in 
the cell types where they are expressed, consisting of H3K4me3 
(trimethylated lysine 4 in histone H3) at the promoter, followed 
by H3K36me3 along the transcribed regions (so-called "K4-K36 
domains"; Guttman etal, 2009; Khalil etal, 2009; CabiH etal, 
2011; Guttman and Rinn, 2012; Rinn and Chang, 2012). While 
IncRNA exons display weaker evolutionary conservation than 



those of protein-coding genes, there is evidence of positive selec- 
tion for a subset of IncRNAs, which may be driven by constraints to 
maintain secondary structure required for functional interactions 
with their targets (Ponjavic et al., 2007; Guttman et al., 2009; Cabili 
et al. , 20 1 1 ; Ulitsky et al. , 20 1 1 ; Derrien et al. , 20 1 2) . In contrast, the 
promoters of IncRNAs are as highly conserved as those of protein- 
coding genes (Carninci et al, 2005; Ponjavic et al., 2007; Guttman 
etal, 2009; Derrien etal, 2012; Batista and Chang, 2013). Despite 
their overall lower expression levels, IncRNAs exhibit a higher 
degree of tissue specificity compared to average protein-coding 
genes (Mercer etal, 2008; CabiU etal, 2011; Derrien etal, 2012; 
Batista and Chang, 2013; Fatica and Bozzoni, 2014). 

Through numerous studies, several general principles of 
IncRNA function have emerged. LncRNAs have been shown to 
function both in cis, i.e., locally close to the site of their production, 
and in trans, i.e., at sites on other chromosomes. LncRNAs have 
been proposed to act as scaffolds for chromatin modifiers, block- 
ers of transcription, antisense RNAs, microRNA sponges, protein 
decoys, and enhancers (Cech and Steitz, 2014; Fatica and Bozzoni, 
2014). In fact, the act of transcription of a IncRNA itself can inter- 
fere with the regulatory function of a regulatory DNA sequence, 
as exemplified in yeast (Martens et al., 2004) and in mammalian 
imprinting (Latos et al, 2012). As a result of their diverse functions 
in multiple tissues, mis-regulation of IncRNAs can lead to failure of 
normal development and, consequently, to disease. Mammalian 
chromatin modifiers such as the repressive polycomh complexes 
often lack their own specific DNA-binding domains but instead 
contain RNA-binding elements. LncRNAs can play critical roles 
in directing these repressive chromatin modifying complexes to 
their target regions (Bernstein and Allis, 2005; Rinn etal., 2007; 
Zhao etal., 2010). One such example is the Foxfl -adjacent, non- 
coding developmental regulatory RNA (Fendrr), a IncRNA that 
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interacts with the polycomb repressive complex 2 (PRC2) and is 
critical for heart development and function (Grote etal., 2013). 
Similarly, the well-characterized HOTAIR IncRNA, which is tran- 
scribed from the HOXC locus, is highly upregulated in primary 
breast tumors and was shown to function through the silencing 
of tumor suppressor genes in a PRC2-dependent manner [Gupta 
et al, 2010; See Maass et al. (2014) for a list of IncRNAs currently 
implicated in human diseases] . Taken together, these features sug- 
gest that IncRNAs and other non-coding RNA species may play 
an essential role in defining organismal complexity (Mattick and 
Makunin, 2006; Taft et al, 2007). 

These findings raise the possibility that IncRNAs and other non- 
coding RNAs may be exciting molecular candidates to account for 
the unresolved genetic risk in complex diseases such as diabetes 
(Medici etal., 1999; Hyttinen etal., 2003). Diabetes mellitus repre- 
sents a group of metabolic diseases that result in impaired glucose 
homeostasis. In the case of type 1 diabetes (TID), metabolic 
impairment is the result of autoimmune destruction of insulin- 
producing pancreatic P-cells. In type 2 diabetes (T2D), the most 
prevalent form of the disease, the defect in glucose metabolism is 
the result of decreased sensitivity of peripheral tissues to insulin 
action, accompanied by failure of fi-cells to compensate for the 
increased metabolic demand (Zimmet et al., 2001). Together, these 
diseases affect over 25 million Americans and account for $176 
billion in healthcare costs per year in the US alone (Association, 
2013), necessitating the pursuit of more effective and personalized 
treatments. 

Significant efforts have been made to attain a better understand- 
ing of the causes of diabetes at the molecular level. Linkage analysis 
of affected families led to the successful identification of causal 
gene mutation in several rare, Mendelian forms of the disease 
(Fajans etal., 2001; O'Rahilly, 2009). However, large-scale efforts 
to identify DNA variants associated with more common forms 
of diabetes through genome-wide association studies (GWAS) 
have predominantly identified candidate variants that lie in non- 
coding regions and with as yet unknown functions (McCarthy, 
2010). Thus, to improve our current understanding of the molec- 
ular basis of diabetes mellitus and to develop better treatment 
strategies, we need to carefully characterize the transcriptome of 
pancreatic P-cells, with a focus on elucidating the functions of 
non-coding transcripts. In this review, we present a summary of 
recent evidence for a role of IncRNAs in the regulation of P-cell 
function and their potential contribution to the pathogenesis of 
diabetes. 

p-CELL IncRNAs 

The most comprehensive catalog of human IncRNAs expressed in 
P-cells published thus far is that by Moran etal. (2012). In this 
study, the authors profiled whole islet and FACS-sorted P-cells 
and identified 1,128 distinct transcripts that displayed many of 
the typical properties of IncRNAs described above, including the 
"K4-K36" histone modification domains, lack of protein-coding 
potential, and non-uniform expression levels among tissues. Most 
notably, the IncRNAs identified were roughly five times more 
islet-specific compared to general protein-coding genes, and the 
vast majority had orthologous genes in the mouse genome. Ku 
etal. (2012) similarly characterized mouse islet- andp-ceU-specific 



transcripts and identified 1,359 high-confidence IncRNAs with 
several of the aforementioned properties. Using high-throughput 
transcriptome analysis of sorted human islets, IncRNAs expressed 
in a-ceUs have also been identified (Bramswig et al., 2013). 

Of particular interest was the fact that IncRNAs were often 
found in proximity to critical islet-specific transcription factors 
(Ku etal., 2012; Moran etal., 2012). Thus, protein-coding genes 
adj acent to islet-enriched IncRNAs were also more likely to be islet- 
specific than the average protein-coding gene (Moran et al., 2012). 
This correlation has led to the suggestion that IncRNAs and nearby 
protein-coding genes share common regulatory elements. Indeed, 
IncRNAs were often found in large regions of open chromatin 
that were uniquely associated with protein-coding genes expressed 
highly in islets (Gaulton et al., 2010). 

The temporal expression of islet IncRNAs has also been stud- 
ied by Moran etal. (2012) in human embryonic pancreases as 
well as in a stepwise in vitro fi-cell differentiation model using 
human embryonic stem (ES) cells (developed by Kroon etal., 
2008). Unlike some IncRNAs that are known to be critical to early 
stages of embryonic development (Guttman etal., 2011; Grote 
et al., 2013), the expression of a majority of islet IncRNAs identi- 
fied in this study (Moran et al., 2012) is restricted to differentiated, 
mature endocrine cells. The orthologous mouse IncRNAs (e.g., 
Mi-Lnc80) exhibit similar cell- and stage-specific expression. 

The characteristics of these islet IncRNAs imply a role for these 
RNAs in mature P-ceU function. To test this hypothesis, Moran 
etal. (2012) used short hairpin RNAs (shRNAs) to suppress the 
activity of one such IncRNA transcript in the human EndoC-fiHl 
P-cell line (Ravassard etal., 2011). From a panel of known islet- 
specific transcripts, the authors identified GLIS3 as a downstream 
target of HI-LNC25, a IncRNA that shares a regulatory domain 
with MAFB. Variants at the GLIS3 locus are associated with dif- 
ferent risks for TID (Barrett etal., 2009), elevated fasting glucose 
levels (Dupuis et al, 2010), as well as T2D (Cho et al, 2012). Loss- 
of function studies suggest that GLIS3 encodes a transcription 
factor critical for regulating the expression of insulin and sev- 
eral key islet-transcription factors, and may confer risk for both 
TID and T2D by resulting in diminished P-cell numbers and by 
promoting the formation of a pro-apoptotic splice variant of the 
protein Bim (Kang et al., 2009; Nogueira et al, 2013; ZeRuth et al., 
2013). However, the shRNA-mediated decrease in GLIS3 mRNA 
levels had no impact on glucose-stimulated insulin secretion or 
insulin transcript levels in the transduced EndoC-PHl fi-ceU line, 
possibly because this cell line does not recapitulate aU aspects of 
P-ceU function in vivo. Additionally, only a minor fraction of P-cell 
expressed IncRNAs was responsive to elevated glucose levels in 
human islets. 

As previously noted, several risk variants for common forms 
of diabetes identified by GWAS do not change the protein-coding 
potential of known genes, suggesting that they might affect as 
yet unidentified regulatory elements (McCarthy, 2010). Using a 
computational tool known as MAGENTA to search for enrichment 
of genetic associations in a predefined set of genes (Segre etal., 
2010), Moran et al. (2012) determined that the islet IncRNA genes 
identified in their study were in fact highly enriched for risk alleles 
associated with T2D and related phenotypes, further underscoring 
the need to interrogate the function of these RNAs in P-cell biology. 
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Overall, these studies highlighted IncRNAs as a major com- 
ponent of the P-cell transcriptome that is cell-type-specific, 
developmentally regulated, and evolutionarily conserved with 
strong associations to disease risk. However, it still remains to 
be determined how these IncRNAs may contribute to P-cell fijnc- 
tion, and if their mis-regulation may play a role in diabetes. Their 
expression in EndoC-PHl cells and mouse islets provides addi- 
tional platforms to evaluate their function in a systematic and 
comprehensive manner. Future studies will also need to address 
the question of whether the IncRNAs identified thus far act in cis 
(on neighboring islet protein-coding genes) or in trans to exert 
their function. 

IMPRINTING 

Some of the best characterized IncRNAs to date were first uncov- 
ered in early studies of imprinting and dosage compensation of the 
X-chromosome (Brannan etal., 1990; Brown etal, 1991; Fatica 
and Bozzoni, 2014). Imprinting refers to the biased expression of 
genes depending on the parental origin of the chromosome. This 
process is tightly regulated, typically through epigenetic modifi- 
cations such as DNA methylation at ds-acting elements known 
as "imprinting control regions" (ICRs), to establish and main- 
tain mono-allelic expression of specific genes (Thorvaldsen and 
Bartolomei, 2007). Methylation at the ICRs is maintained despite 
active demethylation and dynamic reprogramming in the newly 
formed zygote, and is only altered during establishment of methy- 
lation pattern in a sex-specific manner during primordial germ cell 
development (Bartolomei and Ferguson-Smith, 2011). Imprinted 
loci are generally found in large clusters, where both maternally- 
and paternally expressed genes are interspersed. Frequently, the 
protein-coding genes are expressed from one parental allele, while 
non-coding genes are expressed from the other (Barlow, 2011). 
LncRNAs play an essential role in the regulation of mono-allelic 
expression, either by acting in cis as an antisense molecule to block 
the transcriptional machinery, or by directly recruiting repressive 
chromatin modifiers to silence reciprocally expressed genes (Lee 
and Bartolomei, 2013). 

While imprinting is most extensively studied in the context of 
fetal development, tissue-specific regulation in adult tissues has 
also been observed (Barlow, 2011; Lee and Bartolomei, 2013). As 
a result, several imprinted genes are also implicated in human dis- 
eases that arise from somatic tissues. One such example is that 
of the maternally expressed adipose tissue transcription factor, 
KLF14 (Parker- Katiraee et al., 2007), which is associated with risk 
for both T2D and high-density lipoprotein disorders (Teslovich 
etal, 2010; Voight etal, 2010; Small etal, 2011). Perhaps the 
functionally haploid nature of these loci results in their increased 
likelihood to be associated with susceptibility to disease, as muta- 
tions in these genes, when found on the maternal chromosome 
that is expressed, cannot be "covered" by the gene from the 
other, silenced paternal allele. This may be particularly true for 
metabolic disorders, as several imprinted genes encode dosage- 
sensitive proteins related to growth factors and energy metabolism. 
Interestingly, several risk variants for type 1 and type 2 diabetes 
identified through GWAS are located in imprinted loci including 
KCNQl, MEG3, PLAGLl, and GRBIO. A few of these are discussed 
below in the context of islet and P-ceU function. 



DLK1-MEG3 LOCUS 

Recently, we identified the maternally expressed non-coding RNAs 
of the imprinted DLK1-MEG3 locus as down-regulated in human 
islets from T2D donors (Kameswaran et al, 2014). This gene clus- 
ter is located on human 14q32 (mouse chromosome 12) and 
contains three paternally expressed protein-coding genes, DLKl, 
RTLl, and DI03. DLKl is a non-canonical Notch ligand that 
is expressed in many embryonic tissues (Falix etal., 2012) and 
is a well-established negative regulator of adipocyte differenti- 
ation (Smas and Sul, 1993; Mitterberger etal, 2012; Abdallah 
etal., 2013). DLKl is highly expressed in human and mouse P- 
cells (Tornehave etal., 1996; Dorrell etal., 2011; Appelbe etal., 
2013). While DLKl was demonstrated to be stimulated by growth 
hormone and prolactin expression in rat islets, including dur- 
ing pregnancy, it is not directly responsible for the mitogenic 
effects of these hormones on islets (Carlsson et al., 1997; Friedrich- 
sen etal, 2003). Additionally, loss of expression of Dlkl in 
unchallenged mouse P-cells does not cause any observable phe- 
notype (Appelbe etal, 2013). Rtll {Retrotransposon-like 1) is 
critical for normal placental development and its loss results in 
severe developmental defects and late-fetal lethality (Sekita et al., 
2008). 

The maternally expressed genes are all non-coding RNAs, con- 
sisting of the IncRNA, Maternally Expressed Gene 3 {MEG3, known 
as Gtl2 in mice), as well as a large cluster of microRNAs (miRNAs) 
and snoRNAs (Schmidt etal, 2000; Seitz etal, 2004; da Rocha 
etal., 2008). In several tissues, including human islets, the non- 
coding RNAs are all derived from a single transcript that initiates 
from the MEG3 promoter (Tierling etal., 2006; da Rocha etal., 
2008; Kameswaran et al, 2014). 

Reciprocal imprinting is established by methylation of two 
differentially methylated regions (DMRs) on the paternal allele, 
one located ~13 kb upstream of the MEG3 transcription start 
site (IG-DMR), and the other overlapping with the promoter 
of the MEG3 poly cistronic transcript (MEG3-DMR; Figure 1). 
While the IG-DMR is the primary ICR for this imprinted clus- 
ter, the MEG3-DMR is also critical to regulating and maintaining 
imprinting at this region (Kagami et al., 2010). Failure to maintain 
imprinting at this locus can lead to either maternal or paternal uni- 
parental disomy (UPD) of chromosome 14, which causes distinct 
and severe developmental disorders (Kagami etal., 2008). 

Increased methylation of the MEG3-DMR and related loss of 
MEG3 expression has been observed in several human cancers, 
such as pituitary and renal cell cancers and multiple myeloma 
(Zhao etal, 2005; Kawakami etal, 2006; Benetatos etal, 2008) 
to name a few (further reviewed by Benetatos et al, 201 1). These 
studies, coupled with in vitro experiments, suggest that MEG3 
functions as a tumor suppressor by activating p53, in a manner 
dependent upon the secondary structure of the MEG3 RNA (Zhou 
etal., 2007, 2012). Furthermore, decreased expression of MEG3 
and hypermethylation of the DMRs may single-handedly explain 
the subtle phenotypic differences between induced pluripotent 
stem cells (iPSCs) and ES cells, such as the decreased efficiency in 
generating chimeric mice from iPSCs (Stadtfeld etal., 2010). 

Similar to the aforementioned examples, decreased expression 
of MEG3 and the associated miRNAs in T2D islets strongly cor- 
relates with hypermethylation of the MiiG3-DMR (Kameswaran 
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FIGURE 1 I Proposed model of imprinting at the DLK1-MEG3 locus: 

tine DLK1-MEG3 imprinted region contains a primary ICR (IG-DMR) and 
secondary {MEG3-DMR) ICR tinat overlaps with the promoter of the MEG3. 
Both ICRs are paternally methylated. In mouse ES cells, the Meg3 IncRNA 
is believed to direct PRC2 mediated silencing of DIkl (Zhao etal., 2010). 



etal, 2014). Additionally, a single nucleotide polymorphism 
(SNP) (rs941576) located in an intron of MEG3 was found to be 
associated with TID, with the risk allele being transmitted more 
frequently from the father than the mother of the affected offspring 
(Wallace etal., 2010). Overall, these examples provide compelling 
evidence for the importance of MEG3 and the regulation of this 
imprinted region in several diseases. Despite the strong disease 
association of this IncRNA, and the fact that genes in this imprinted 
cluster are very highly expressed in human P-cells (Dorrell et al., 
2011; Kameswaran et al., 2014), there are currently no postulated 
mechanisms for its potential role in P-cell function and diabetes 
pathogenesis. 

Recent studies have suggested that similar to other nuclear IncR- 
NAs, MEG3 also directly interacts with the PRC2 complex in ES 
cells to guide the repressive histone modification mark H3K27me3 
to its target sites (Zhao et al., 2010; Kaneko et al, 2014). One study 
identified Dlkl as a direct target of the Me^J-PRC2 complex in 
mouse ES cells (Figure 1 ) , although this finding could not be repli- 
cated in M£G3-expressing human iPSCs, where MEGS was found 
to function in trans (Zhao etal, 2010; Kaneko etal., 2014). A 
careful characterization of MiiG3-PRC2 complex targets in adult 
pancreatic islets will provide better insights into the role of this 
IncRNA in P-cell function. 

KCNQ1 LOCUS 

The KCNQl gene, encoding a voltage-gated potassium chan- 
nel, has been of great interest to the P-cell biology field due 
to its strong disease association. The gene is located in an 
imprinted locus on human llplS.S, adjacent to another inde- 
pendently regulated imprinted locus, H19-IGF2. This region was 
implicated as a molecular candidate for Beckwith-Wiedemann 



syndrome (BWS), a disorder characterized by prenatal macro- 
somia, predisposition for tumor development and frequently, 
hyperinsulinemic hypoglycemia (Lee etal., 1997, 1999; Hussain 
etal., 2005). This imprinted region consists of several con- 
served, maternally expressed protein-coding genes, such as the 
cell cycle inhibitor CDKNIC, and a paternally expressed anti- 
sense IncRNA, KCNQl overlapping transcriptl {KCNQIOTI; 
Monk etal., 2006). Loss of imprinting in this locus can lead 
to the suppression of CDKNIC, which is sufficient to cause re- 
entry of adult human P-cells into the cell cycle (Avrahami et al., 
2014). 

Imprinting of this region is maintained by a maternally methy- 
lated ICR, known as the KvDMR, which is also the promoter 
for KCNQIOTI (Figure 2). To maintain appropriate mono- 
allelic expression of imprinted genes in this locus, the KvDMR 
is hypomethylated on the paternal allele, leading to expression 
of the KCNQIOTI IncRNA and subsequent repression of the 
maternal, protein-coding genes on the same allele (Fitzpatrick 
etal., 2002; IderaabduUah etal., 2008), possibly by facilitating 
intra-chromasomal looping to direct the repressive PRC2 com- 
plex to their promoter (Figure 2; Zhao etal, 2010; Zhang etal., 
2014). 

The KCNQl locus harbors at least two independently identi- 
fied and replicated GWAS signals at SNPs located in the intron 
of the KCNQl gene (rs2237892), with one overlapping the 
KCNQIOTI IncRNA (rs231362; Unoki etal, 2008; Yasuda etal, 
2008; Kong etal, 2009; Voight etal, 2010). Additional SNPs in 
this gene, such as rs2237895, are also reported to be associated 
with T2D risk in specific ethnic populations (Unoki etal, 2008). 
While these SNPs are predicted to confer risk for diabetes only 
when maternally inherited (Kong etal., 2009), the risk alleles do 
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FIGURE 2 I Proposed model of imprinting at the KCNQl locus: the 

KCNQIOTI IncRNA is expressed from the paternally unmethylated KvDMR 
ICR, which is methylated on the maternal allele. Recent evidence suggests 
that KCNQIOTI can directly recruit the PRC2 complex and facilitate 
intra-chromosomal looping to the KCNQl promoter (Zhang etal., 2014). 
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not correlate with each other (Kong etal, 2009; Voight etal., 
2010) and have opposing effects on docking of insulin granules 
(Rosengren et al., 2012). 

To investigate how these T2D risk variants may affect allelic 
expression and imprinting of this region, Travers etal. (2013) 
correlated the risk SNP genotypes with DNA methylation and 
expression patterns of the imprinted genes in human fetal pancreas 
and adult islets. This study revealed that fetal samples homozygous 
for the rs2237895 risk allele had marginally increased methy- 
lation levels at the KvDMR region. As this was not observed 
in the adult, these results suggest that effects of the risk allele 
are likely be established during early stages of islet develop- 
ment, as KCNQl and KCNQIOTI are only imprinted in fetal 
but not adult tissues (Monk etal., 2006; Travers etal, 2013). 
Overall, this study proposes a model whereby each risk allele 
for the rs2237895 SNP leads to increased methylation of the 
KvDMR, and consequently, decreased expression of KCNQIOTI. 
However, there was no observable difference in KCNQl or 
KCNQIOTI expression in samples used for this study. On the 
contrary, KCNQIOTI transcript levels have been shown to be 
significantly elevated in T2D islets (where SNP genotype was 
not determined; Moran etal., 2012), which parallels an overall 
decrease in methylation at several tested CpGs near the KCNQl 
gene (Dayeh etal., 2014). Thus, the interpretation of variants 
to disease pathology at this region has been contradictory and 
challenging. Nevertheless, the regulation of this locus and the 
IncRNA KCNQIOTI remains relevant to P-cell biology and T2D 
pathogenesis. 

H19-IGF2 LOCUS 

The H19-IGF2 locus resides adjacent to the KCNQl region 
on human llplS.S. The region consists of the paternally 
expressed insulin-like growth factor 2 {IGF2) gene and mater- 
nally expressed H19 IncRNA (Brannan etal., 1990; DeChiara 
etal, 1990; Bartolomei etal, 1991). The IGF2 protein func- 
tions as a growth factor essential for embryonic develop- 
ment (DeChiara etal, 1990), whereas H19 may function as 
a tumor suppressor (Hao etal., 1993). Imprinting at this 
locus is maintained by an ICR, which is selectively methy- 
lated on the paternal allele. The insulator protein, CCCTC- 
binding factor (CTCF), binds to critical regulatory regions in 
the unmethylated ICR on the maternal allele, thus blocking 
access of downstream enhancers to the IGF2 promoter (Figure 3; 
Stadnick etal, 1999; Bell and Felsenfeld, 2000; Engel etal, 

2004) . 

Loss of methylation at the H19/IGF2 ICR results in short body 
length and low birth weight, both in rodent models (DeChiara 
etal., 1990) as well as in humans, such as patients with SUver- 
Russell syndrome, a developmental disorder characterized by 
intrauterine and postnatal growth retardation (Gicquel etal., 

2005) . This has also been observed in humans who were peri- 
conceptually exposed to famine (Heijmans etal., 2008). There 
is growing evidence that intra-uterine exposure to malnutri- 
tion can predispose the offspring to metabolic complications 
including P-cell dysfunction and diabetes later in life (Ravelli 
etal., 1998; Roseboom etal., 2006). This theory is commonly 
referred to as the "thrifty phenotype hypothesis" (Hales, 2001) 



and is thought to be mediated primarily through environmentally 
induced epigenetic changes to key metabolic regulators (Park 
etal., 2008; Bramswig and Kaestner, 2012). However, first 
and second generation progeny of mice exposed to gestational 
diabetes were found to have impaired glucose tolerance with 
hypermethylation of the H19 ICR in islets (Ding etal, 2012). 
These contradicting observations may be a result of differ- 
ent nutrient availability that the developing fetus was exposed 
to, as well as the varying lengths of exposure. The above 
studies suggest that the H19-IGF2 locus is highly responsive 
to these changes in the intrauterine milieu and may repre- 
sent a prognostic marker of metabolic complications later in 
life. 

Hypermethylation of the H19-IGF2 ICR has been observed 
in some cases of BWS (Ohlsson etal, 1993), as well as in focal 
congenital hyperinsulinism (FoCHI), a glucose metabolism disor- 
der characterized by unbridled insulin secretion from hyperplastic 
islet cells and consequent hypoglycemia (de Lonlay etal, 1997). 
Increased methylation at this ICR would be predicted to result 
in decreased Hi 9 expression, loss of imprinting at this region 
and a concomitant increase in IGF2 expression. Although over- 
expression of IGF2 in mouse P-cells recapitulates the FoCHI 
phenotype (Devedjian etal., 2000), IGF2 expression was variable 
in human FoCHI lesions (Fournet etal., 2001). On the contrary. 
Hi 9 transcript levels were consistently down-regulated in these 
cells, suggesting that Hi 9 may have an important regulatory role 
in restraining islet-proliferation. This hyperproliferative pheno- 
type, accompanied by suppression of Hi 9 has also been reported 
in Wilms' tumor (Cui etal., 1997). Taken together, the Hi9lncRNA 
may function as a critical regulator of P-cell function and prolifer- 
ation either on its own or indirectly through the regulation of IGF2 
levels. 
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ZAC-HYMAI LOCUS 

Transient neonatal diabetes (TNDM) is a rare form of diabetes 
mellitus characterized by hyperglycemia and low insulin levels 
within the first year of birth (Temple etal., 2000). This form of 
diabetes is distinct from TID as there is no evidence for autoim- 
munity (Abramowicz etal, 1994; Shield etal, 1997). Although 
it usually resolves by 2 years of age, children with TNDM are 
at a higher risk of developing T2D later in life (Temple etal., 
2000). The molecular cause of this disease was identified to be 
abnormal imprinting of chromosome 6q24, which encompasses 
the cell cycle regulator, ZAC/PLAGLl, and the IncRNA, HYMAI 
(Abramowicz etal, 1994; Arima etal, 2000; Gardner etal, 2000; 
Kamiya etal., 2000; Mackay etal, 2002). Both ZAC and HYMAI 
share a common imprinted promoter (PI in Figure 4), which 
also serves as the ICR, and are expressed from the paternal allele 
(Arima etal., 2000; Mackay etal., 2002). However, tissue-specific 
usage of an alternative promoter (P2 in Figure 4) that drives 
biaUelic expression of ZAC has also been reported (Valleley et al., 
2007). 

ZAC encodes a zinc finger protein that regulates apoptosis and 
cell cycle arrest (Spengler etal., 1997). The protein is expressed 
at very high levels in insulin-producing cells in the human fetal 
pancreas, but not adult islets (Du et al., 201 1). ZAC can also func- 
tion as a transcriptional activator of CDKNIC and KCNQIOTI 
(Arima etal., 2005). ZAC is believed to control the induction of 
the pituitary adenylate cyclase-activating polypeptide (PACAP), a 
strong activator of glucose-stimulated insulin secretion (Yada et al., 
1994; Ciani etal., 1999). These features of the ZAC gene make 
this a strong candidate for the pathogenesis of TNDM. However, 
the mechanism of imprinting and the function of HYMAI in the 
context of TNDM have yet to be established. 



MALAT1, AN ABUNDANT IncRNA 

The metastasis-associated lung adenocarcinoma transcript 1 
(MALATl j is a highly conserved IncRNA that is mis-regulated 
in several tumors (Ji etal., 2003; Gutschner etal, 2013). MALATl 
is very abundantly expressed (higher than many housekeeping 
genes) in multiple cell types, including the pancreas (Ji et al., 2003) 
and in purified human a- and P-cells (Dorrell etal, 2011). Addi- 
tionally, MALATl is encoded within an active enhancer cluster with 
several binding sites for islet-transcription factors (Pasquali et al., 
2014), making this is an intriguing candidate for gene regulation 
in human islets. 

Metastasis-associated lung adenocarcinoma transcript 1 has 
several interacting partners through which it may mediate its 
function. One such interacting partner is DGCR8, a double- 
stranded RNA binding protein that together with Drosha mediates 
miRNA bioprocessing (Macias etal, 2012). MALATl was found 
to be bound to Argonaute (Ago), the primary effector of miRNA 
function in HeLa cells (Weinmann etal, 2009). MALATl was 
also found to be associated with Ago in human islets, suggest- 
ing that this IncRNA may be regulated by miRNAs in human cells 
(Kameswaran et al., 2014). In fact, we discovered several sequences 
that consisted of miRNAs fused to MALATl while assaying miR- 
NAs and their targets that were bound to Ago in human islets. 
These chimeric reads were the result of ligation of two adjacent 
RNA species present in the RISC complex with Ago (Helwak et al., 
2013), and proved that MALATl is regulated by several miRNAs 
in human islets (Kameswaran etal., 2014). 

Metastasis-associated lung adenocarcinoma transcript 1 can also 
regulate gene expression through its association with different 
nuclear sub-compartments (Hutchinson etal., 2007; Yang etal., 
2011; Gutschner etal, 2013). One example of this is MALATl 
localization in nuclear speckles, which are nuclear domains where 
splicing factors are stored and post-transcriptionally modified 
(Hutchinson etal., 2007; Mao etal, 2011). Through the modi- 
fication of critical splicing factors, MALATl has been shown to 
contribute to alternative splicing (Tripathi etal, 2010). However, 
despite the abundance of this IncRNA and the early suggestions of 
its function from in vitro studies, mice lacking MALATl displayed 
no obvious phenotype in the absence of additional pathological 
stressors and exhibit largely normal nuclear speckle formation and 
alternative splicing patterns (Eiiimann et al., 2012; Nakagawa et al., 
2012; Zhang etal., 2012). Thus, the role of this IncRNA remains to 
be determined. 

PERSPECTIVE 

The exciting discovery of IncRNAs and the growing recognition of 
their involvement in human pathogenesis have added a new level 
of complexity to our understanding of gene regulation. However, 
due to the range of sequencing and bioinformatic tools currently 
available, the rate of discovery of new IncRNAs has surpassed our 
ability to examine their function. This gap between IncRNA gene 
discovery and function currently holds true in the field of P-ceU 
biology as well, necessitating the systematic analysis of mouse and 
human islet IncRNAs identified to date (Ku etal., 2012; Moran 
etal, 2012). 

Factors such as overlap between the human and mouse a- and 
P-cell IncRNA complements (Ku etal, 2012; Moran etal., 2012; 
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Bramswig etal., 2013), degree of conservation, expression, asso- 
ciated protein-coding genes, and relative distance from GWAS 
SNP variants may be good early predictors of important IncRNAs. 
However, these parameters alone may underestimate other essen- 
tial candidates, as some IncRNAs exhibit low primary sequence 
conservation despite crucial function (Nesterova etal., 2001), or, 
conversely, a dispensable function despite high sequence conser- 
vation and expression (Zhang etal, 2012). These observations 
emphasize the need for careful loss-of-function experiments in 
appropriate model systems induced by metabolic and/or inflam- 
matory challenges to clearly understand the function of these 
IncRNAs. Although many of the human P-ceU IncRNAs are 
expressed in the EndoC-fiHl cell line that somewhat resembles 
human P-cells in vitro (Ravassard etal., 20 11), targeted deletion or 
inhibition in mouse and human islets may be necessary in some 
cases to reveal their function, as seen in the example of HI-LNC25 
discussed above (Moran et al, 2012). 

While the loss-of-function of even abundant IncRNAs such as 
MALATl may sometimes result in a lack of phenotype (Eifimann 
et al., 2012; Nakagawa et al., 2012; Zhang et al., 2012), lessons from 
the miRNA field suggest that additional physiological and environ- 
mental stressors maybe necessary to truly elucidate the function of 
these non-coding RNAs (Mendell and Olson, 2012). Additionally, 
in order to study the role of IncRNAs in the context of loss-of- 
function, a careful analysis of the genomic location of the IncRNAs 
may be required to evaluate the best method of gene silencing, as 
targeted recombination may result in disruption of overlapping 
protein-coding transcripts or their regulatory domains, further 
confounding data interpretation. 

Given the broad range of human diseases that IncRNAs are now 
associated with, it is perhaps not surprising that there is growing 
evidence for their role in ^-cell function and diabetes pathogenesis. 
Revealing their function will undoubtedly lead to a new wave of 
exciting targets to explore for therapeutic development. 
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